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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Soderlund, David M. 

Knipple, Douglas C. 
Ingles, Patricia J. 

(ii) TITLE OF INVENTION: INSECT SODIUM CHANNELS FROM 

INSECTICIDE-SUSCEPTIBLE AND INSECTICIDE-RESISTANT 
HOUSE FLIES 

(iii) NUMBER OF SEQUENCES: 19 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Nixon, Hargrave, Devans & Doyle LLP 
□ (B) STREET: P.O. Box 1051, Clinton Square 

J3 (C) CITY: Rochester 

(D) STATE: New York 

ITJ (E) COUNTRY: USA 

01 (F) ZIP: 14603 

iJ-J 

Kl (v) COMPUTER READABLE FORM: 

(2 (A) MEDIUM TYPE: Floppy disk 

\ (B) COMPUTER: IBM PC compatible 

\± (C) OPERATING SYSTEM: PC-DOS/MS-DOS 

n (D) SOFTWARE: Patent In Release #1.0, Version #1.3 0 

Lji (vi) CURRENT APPLICATION DATA: 
~ (A) APPLICATION NUMBER: 

~ (B) FILING DATE: 

^ j (C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

* (A) APPLICATION NUMBER: US 08/608,618 

(B) FILING DATE: 01-MAR-1996 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Braman, Susan J. 

(B) REGISTRATION NUMBER: 34,103 

(C) REFERENCE/DOCKET NUMBER: 19603/601 (CRF D-1657) 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 716-263-1636 

(B) TELEFAX: 716-263-1600 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 



1 
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(A) LENGTH: 6318 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

ATGACAGAAG ATTCCGACTC GATATCTGAG GAAGAACGCA GTTTGTTCCG TCCCTTCACC 6 0 

CGCGAATCAT TGTTACAAAT CGAACAACGT ATCGCTGAAC ATGAAAAACA AAAGGAGCTG 12 0 

GAAAGAAAGA GAGCCGCCGA AGGAGAGCAG ATACGATATG ATGACGAGGA CGAAGATGAA 18 0 

. f =• 

C^TCCACAGC CGGATCCCAC ACTTGAACAG GGTGTGCCTA TACCTGTTCG AATGCAGGGC 24 0 

ASCTTCCCGC CGGAATTGGC CTCCACTCCT CTCGAGGATA TCGATCCCTT CTACAGTAAT 3 00 

C?f ACTGACAT TTGTAGTAAT AAGTAAAGGA AAGGATATTT TTCGTTTTTC TGCCTCAAAA 3 60 

dr^AATGTGGC TGCTCGATCC ATTCAATCCG ATACGTCGTG TAGCCATTTA TATTTTAGTG 42 0 

dSTCCCTTGT TTTCGTTATT CATTATCACC ACTATTCTAA CTAATTGTAT TTTAATGATA 480 

i&GCCGACAA CGCCCACGGT CGAATCCACA GAGGTGATAT TCACCGGAAT CTACACATTT 54 0 

01 

GBATCAGCTG TTAAAGTGAT GGCACGAGGT TTCATTTTAT GCCCGTTTAC GTATCTTAGA 60 0 

GATGCATGGA ATTGGCTGGA CTTCGTAGTA ATAGCTTTAG CTTATGTGAC C ATGGG CAT A 6 60 

GATTTAGGTA ATCTCGCAGC TTTGAGAACA TTTAGGGTAC TGCGAGCTCT GAAAACCGTA 72 0 

GCCATTGTGC CAGGT CTAAA AACCATTGTC GGTGCTGTCA TTGAATCTGT AAAAAATCTA 78 0 

CGCGATGTGA TAATTTTGAC AATGTTTTCC CTGTCGGTGT TCGCGCTGAT GGGCCTACAA 84 0 

ATCTATATGG GTGTTCTAAC ACAAAAGTGC ATTAAACGAT TCCCCCTGGA CGGCAGTTGG 900 

GGCAATCTGA CCGATGAAAA CTGGTTTCTA CACAATAGCA ACAGTTCCAA TTGGTTTACG 96 0 

GAGAACGATG GCG AGT CAT A TCCGGTGTGC GGGAATGTAT CCGGTGCGGG ACAATGCGGC 102 0 

GAGGATTACG TCTGCCTGCA GGGCTTCGGC CCCAATCCCA ACTACGACTA CACCAGTTTC 10 8 0 

GATTCATTCG GTTGGGCTTT CCTGTCGGCG TTTCGTCTCA TGACCCAAGA TTTCTGGGAG 114 0 

GAT C TG TAT C AGCACGTGCT GCAAGCAGCT GGACCCTGGC ACATGTTGTT CTTTATAGTC 12 0 0 



J 
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ATCATCTTCC 


TAGGTTCATT 


CTATCTTGTG 


AATTTGATTT 


TGGCCATTGT 


TGCCATGTCT 


1260 


TATGACGAAT 


TGCAAAAGAA 


GGCCGAAGAA 


GAAGAGGCTG 


CCGAGGAGGA 


GGCGATACGA 


1320 


GAAGCTGAAG 


AAGCGGCAGC 


AGCCAAGGCG 


GCCAAACTGG 


AGGAGCGGGC 


CAATGTAGCA 


1380 


GCTCAAGCGG 


CTCAGGATGC 


AGCGGATGCC 


GCTGCGGCAG 


CTCTGCATCC 


CGAGATGGCA 


1440 


AAGAGTCCCA 


CGTACTCTTG 


CATTAGCTAT 


GAACTGTTTG 


TTGGCGGCGA 


GAAGGGCAAC 


1500 


GATGACAACA 


ACAAAGAGAA 


GATGTCCATA 


CGCAGCGTCG 


AAGTGGAATC 


GGAGTCGGTG 


1560 


AGCGTTATAC 


AAAGACAACC 


AGCACCTACC 


ACAGCACCCG 


CTACTAAAGT 


CCGTAAAGTT 


1620 


AGCACGACTT 


CCTTATCCTT 


ACCTGGTTCA 


CCATTTAACC 


TACGCCGGGG 


ATCACGTAGT 


1680 


jjTCACACAAGT 


ACACAATACG 


AAATGGGCGT 


GGACGTTTTG 


GT AT AC C AGG 


TAGCGATCGC 


1740 


SIaagccattgg 


TACTGCAAAC 


ATATCAGGAT 


GCCCAGCAGC 


ATTTGCCCTA 


TGCCGATGAC 


1800 


riTCGAATGCCG 


TAACACCAAT 


GTCCGAAGAG 


AATGGTGCCA 


TTATAGTACC 


AGCCTACTAT 


1860 


/rTGTAATTTAG 


GTTCTAGACA 


TTCTTCATAT 


ACCTCGCATC 


AATCAAGAAT 


CTCGTATACA 


1920 


^ T C AC ATGGTG 


ATTTATTGGG 


TGGCATGGCG 


GCCATGGGTG 


CCAGCACAAT 


G AC C AAAG AG 


1980 


^AGCAAATTGC 


GCAGTCGCAA 


CACACGCAAT 


CAATCAATCG 


GTGCTGCAAC 


CAATGGTGGC 


2040 


Ojagtagtacgg 


CTGGTGGTGG 


CT AT C C CG AT 


GCCAATCACA 


AGGAACAAAG 


GGATTATGAA 


2100 


03ATGGGTCAGG 


ATTATACAGA 


CGAAGCTGGC 


AAAATAAAAC 


ACCACGACAA 


TCCTTTTATC 


2160 


GAGCCCGTCC 


AAACTCAAAC 


AGTGGTAGAC 


ATGAAAGATG 


TTATGGTCTT 


AAATGAT AT C 


2220 


ATTGAACAAG 


CCGCTGGTCG 


GCATAGTCGT 


GCTAGTGAAC 


GAGGTGAGGA 


CGATGACGAA 


2280 


GATGGTCCCA 


CATTCAAGGA 


CATCGCCCTC 


GAATACATCC 


T AAAAGG CAT 


CGAAATCTTT 


2340 


TGTGTATGGG 


ACTGTTGTTG 


GGTGTGGTTA 


AAATTTCAGG 


AATGGGTGTC 


CTTTATTGTG 


2400 


TTCGATCCAT 


TCGTGGAGCT 


CTTCATTACC 


CTGTGTATTG 


TGGTCAATAC 


GATGTTTATG 


2460 


GCCATGGATC 


ATCACGACAT 


GAATCCGGAA 


TTAGAGAAGG 


TGCTGAAAAG 


TGGTAACTAT 


2520 


TTCTTCACGG 


CCACTTTTGC 


AATTGAAGCC 


AGCATGAAAC 


TGATGGCCAT 


GAGCCCGAAG 


2580 


TACTACTTCC 


AGGAAGGCTG 


GAACATTTTC 


GATTTCATTA 


TTGTGGCCTT 


GTCTCTGCTG 


2640 
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GAATTGGGCC 


TGGAGGGTGT 


CCAGGGCCTG 


TCGGTGTTGA 


GAAGTTTTCG 


TTTGCTTCGT 


2700 


GTATTCAAAT 


TGGCAAAATC 


ATGGCCCACA 


CTCAATTTAC 


TCATTTCGAT 


TATGGGCCGG 


2760 


ACAATGGGTG 


CATTGGGTAA 


TCTGACATTT 


GTACTTTGCA 


TT AT CAT CTT 


CATCTTTGCC 


2820 


GTGATGGGAA 


TGCAACTTTT 


CGGAAAGAAC 


TATATTGACC 


AC AAGG AT CG 


CTTCAAGGAC 


2880 


CATGAATTAC 


CGCGCTGGAA 


CTTCACCGAC 


TTCATGCACA 


GCTTCATGAT 


TGTGTTCCGA 


2940 


GTGCTGTGCG 


GAGAGTGGAT 


CGAGTCCATG 


TGGGACTGCA 


TGTATGTGGG 


CGATGTCAGC 


3000 


TGTATACCCT 


TCTTCTTGGC 


CACGGTCGTG 


ATAGGCAATC 


TTGTGGTTCT 


TAATCTTTTC 


3060 


TTAGCTTTGC 


TTTTGTCCAA 


CTTCGGTTCA 


TCTAGTTTAT 


CAGCCCCGAC 


TGCCGACAAT 


3120 


Q GATACCAATA 


AAATAGCAGA 


GGCCTTCAAT 


CGTATTGCTC 


GTTTTAAGAA 


CTGGGTGAAA 


3180 


j- CGTAATATTG 


CCGATTGTTT 


TAAGTTAATT 


CGAAATAAAT 


TGACAAATCA 


AATAAGTGAC 


3240 


ru 

q CAACCATCAG 


AACATGGCGA 


TAATGAACTG 


GAGTTGGGTC 


ATGACGAAAT 


CATGGGCGAT 


3300 


hi 

rj GGCTTGATCA 


AAAAGGGTAT 


GAAGGGCGAG 


ACCCAGCTGG 


AGGTGGCCAT 


TGGCGATGGC 


3360 


ATGGAGTTCA 


CGATACATGG 


CGATATGAAA 


AACAACAAGC 


CGAAGAAATC 


AAAATTCATG 


3420 


ll AACAACACAA 


CGATGATTGG 


AAACTCAATA 


AACCACCAAG 


ACAATAGACT 


GGAACATGAG 


3480 


^ CTAAACCATA 


GAGGTTTGTC 


CATACAGGAC 


GATGACACTG 


CCAGCATTAA 


CTCATATGGT 


3540 


^0 AGCCATAAGA 


AT CG AC CAT T 


CAAGGACGAG 


AGCCACAAGG 


GCAGCGCCGA 


GACCATCGAG 


3600 


GGCGAGGAGA 


AACGCGACGT 


CAGCAAAGAG 


GACCTCGGCC 


TCGACGAGGA 


ACTGGACGAG 


3660 


GAGGCCGAGG 


GCGATGAGGG 


CCAGCTGGAT 


GGTGACATTA 


TCATTCATGC 


GCAAAACGAC 


3720 


GACGAGATAA 


TCGACGACTA 


TCCGGCCGAC 


TGTTTCCCCG 


ACTCGTACTA 


CAAGAAGTTT 


3780 


CCGATCTTGG 


C CGGCG AC G A 


GGACTCGCCG 


TTCTGGCAAG 


GATGGGGCAA 


TTTACGACTG 


3840 


AAAACTTTTC 


AATTAATTGA 


AAATAAATAT 


TTTGAAACCG 


CAGTTATCAC 


TATGATTTTA 


3900 


ATGAGTAGCT 


TAGCTTTGGC 


CTTAGAAGAT 


GTTCATTTAC 


CCGATCGACC 


TGTCATGCAG 


3960 


GATATACTGT 


ACTACATGGA 


CAGGATATTT 


ACGGTGATAT 


TCTTTTTGGA 


GATGTTGATC 


4020 


AAATGGTTGG 


CCCTGGGCTT 


TAAGGTTTAC 


TTCACCAATG 


CCTGGTGTTG 


GCTGGATTTC 


4080 



t 
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GTGATTGTCA 


TGCTATCGCT 


TATAAATTTG 


GTTGCCGTTT 


GGTCGGGCTT 


AAATGATATA 


4140 


GCCGTGTTTA 


GATCAATGCG 


CACACTGCGC 


GCCCTAAGGC 


CATTGCGTGC 


TGTCTCTAGA 


4200 


TGGGAGGGTA 


TGAAAGTTGT 


CGTGAATGCG 


CTGGTTCAAG 


CTATACCGTC 


CATCTTCAAT 


4260 


GTGCTATTGG 


TGTGTCTGAT 


ATTTTGGCTT 


ATTTTTGCCA 


TTATGGGAGT 


ACAGCTTTTT 


4320 


GCTGGAAAAT 


ATTTTAAGTG 


TAAAGATGGT 


AATGACACTG 


TGCTGAGCCA 


TGAAAT CAT A 


4380 


CCGAATCGTA 


ATGCCTGCAA 


AAGTGAAAAC 


TACACCTGGG 


AAAATTCGGC 


AATGAACTTC 


4440 


GATCATGTAG 


GTAATGCGTA 


TCTCTGTCTA 


TTTCAAGTGG 


CCACCTTTAA 


GGGCTGGATC 


4500 


CAGATTATGA 


ACGATGCCAT 


TGATTCACGA 


GAGGTGGACA 


AGCAGCCGAT 


CCGAGAAACC 


4560 


AATATCTACA 
JAATCTGTTCA 


TGTATTTATA 


TTTCGTATTC 


TTCATTATAT 


TTGGATCATT 


TTTCACACTC 


4620 


TTGGTGTTAT 


CATTGATAAT 


TTTAATGAAC 


AAAAGAAGAA 


AGCTGGTGGA 


4680 


rtrCATTAGAAA 


TGTTCATGAC 


AGAAGATCAG 


AAAAAGTACT 


ATAATGCTAT 


GAAAAAGATG 


4740 


cs 

ySGCTCTAAAA 


AACCATTAAA 


AGCCATTCCA 


AGACCGAGGT 


GGCGACCACA 


AGCAATAGTA 


4800 


LiTTCGAAATAG 


TTACAGATAA 


AAAATTCGAT 


ATAATCATTA 


TGTTGTTCAT 


TGGCTTAAAC 


4860 


jjVTGTTTACCA 


TGACCCTCGA 


TCGGTACGAC 


GCCTCCGAGG 


CGTACAACAA 


TGTCCTCGAC 


4920 


tkAACTCAATG 


GGATATTCGT 


AGTTATTTTC 


AGTGGCGAAT 


GTCTATTAAA 


AATATTCGCT 


4980 


[iTACGATATC 


ACTATTTCAA 


AGAGCCATGG 


AATTTATTTG 


ATGTAGTAGT 


TGTCATTTTA 


5040 


^RrCCATCTTAG 


GTCTTGTACT 


CAGCGACATC 


ATTGAGAAGT 


ATTTCGTATC 


GCCGACACTG 


5100 


CTCCGTGTGG 


TGAGAGTGGC 


CAAAGTGGGT 


CGTGTCCTGC 


GTTTAGTCAA 


GGGTGCCAAG 


5160 


GGTATCCGGA 


CGTTGCTGTT 


CGCGTTAGCC 


ATGTCGTTGC 


CTGCCTTATT 


CAACATTTGT 


5220 


CTGTTGCTGT 


TCTTGGTGAT 


GTTCATCTTT 


GCT AT CTTTG 


GCATGTCCTT 


CTTCATGCAT 


5280 


GTCAAAGAGA 


AGAGCGGCAT 


AAATGCTGTG 


TATAATTTTA 


AGACATTTGG 


C C AAAGT ATG 


5340 


ATATTGCTGT 


T T C AG ATGT C 


TACCTCAGCC 


GGTTGGGATG 


GTGTGTTAGA 


TGCCATTATC 


5400 


AATGAGGAAG 


ATTGCGATCC 


ACCCGACAAC 


GACAAGGGCT 


ATCCGGGCAA 


TTGTGGTTCA 


5460 


GCGACTGTTG 


GAATTACGTT 


TCTCCTTTCA 


TATCTAGTTA 


TAAGCTTTTT 


GATAGTTATT 


5520 
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AATATGTACA 


TTGCTGTCAT 


TCTCGAGAAC 


TATAGCCAGG 


CTACGGAGGA 


TGTACAGGAG 


5580 


GGTCTCACCG 


ACGAGGATTA 


GGATATGTAC 


TACGAGATTT 


GGCAACAATT 


CGATCCGGAG 


5640 


GGCACCCAGT 


ACATACGCTA 


CGACCAGCTG 


TCCGAGTTTC 


TGGACGTGCT 


GGAGCCGCCG 


5700 


CTGCAGATCC 


ACAAGCCGAA 


CAAGTACAAA 


ATCATATCGA 


TGGACATGCC 


GATATGTCGG 


5760 


GGCGACATGA 


TGTACTGTGT 


GGATATATTG 


GATGCCCTGA 


CCAAGGACTT 


CTTTGCGCGC 


5820 


AAGGGTAATC 


CGAT CGAGG A 


GACGGGTGAA 


ATTGGTGAGA 


TAGCGGCGCG 


ACCGGACACC 


5880 


GAGGGCTATG 


ATCGGGTGTC 


GTCAACACTG 


TGGCGCCAGC 


GTGAGGAGTA 


CTGCGCCAAG 


5940 


CTGATACAGA 


ATGCGTGGCG 


GCGTTACAAG 


AATGGCCCAC 


CCCAGGAGGG 


TGATGAGGGC 


6000 


GAGGCGGCTG 


GTGGCGAAGA 


TGGTGCTGAA 


GGCGGTGAGG 


GTGAAGGAGG 


CAGCGGCGGC 


6060 


JJggcggcggtg 


ATGATGGTGG 


CTCAGCGACA 


GGAGCAACGG 


CGGCGGCGGG 


AGCCACATCA 


6120 


f ; CCCTCAGATC 

ni 


CAGATGCCGG 


CGAAGCAGAT 


GGTGCCAGCG 


TCGGCGGCCC 


CCTTAGTCCG 


6180 


^ GGCTGTGTTA 

UJ 


GTGGCGGCAG 


TAATGGCCGC 


CAAACGGCCG 


TACTGGTCGA 


AAGCGATGGT 


6240 


y TTTGTTACAA 


AAAACGGTCA 


TAAGGTTGTA 


ATACACTCGA 


GATCGCCGAG 


C AT AAC AT C C 


6300 


- AGGACGGCAG 


ATGTCTGA 










6318 



p (2) INFORMATION FOR SEQ ID NO : 2 : 

fU 

m (i) SEQUENCE CHARACTERISTICS: 

fp (A) LENGTH: 6315 base pairs 

JS (B) TYPE: nucleic acid 

w (C) STRANDEDNESS : single 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 
ATGACAGAAG ATTCCGACTC GATATCTGAG GAAGAACGCA GTTTGTTCCG TCCCTTCACC 
CG CGAAT CAT TGTTACAAAT CGAACAACGT ATCQCTGAAC ATGAAAAACA AAAGGAGCTG 



60 
120 
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GAAAGAAAGA 


GAGCCGCCGA 


AGGAGAGCAG 


ATACGATATG 


ATGACGAGGA 


CGAAGATGAA 


180 


GGTCCACAGC 


CGGATCCCAC 


ACTTGAACAG 


GGTGTGCCTA 


TACCTGTTCG 


AATGCAGGGC 


240 


AGCTTCCCGC 


CGGAATTGGC 


CTCCACTCCT 


CTCGAGGATA 


TCGATCCCTT 


CTACAGTAAT 


300 


GTACTGACAT 


TTGTAGTAAT 


AAGTAAAGGA 


AAGGATATTT 


TTCGTTTTTC 


TGCCTCAAAA 


360 


GCAATGTGGC 


TGCTCGATCC 


ATTCAATCCG 


ATACGTCGTG 


TAGCCATTTA 


TATTTTAGTG 


420 


CATCCCTTGT 


TTTCGTTATT 


CATTATCACC 


ACTATTCTAA 


CTAATTGTAT 


TTTAATGATA 


480 


ATGCCGACAA 


CGCCCACGGT 


CGAATCCACA 


GAGGTGATAT 


TCACCGGAAT 


CTACACATTT 


540 


GAATCAGCTG 


TTAAAGTGAT 


GGCACGAGGT 


TTCATTTTAT 


GCCCGTTTAC 


GTATCTTAGA 


600 


.J3ATGCATGGA 


ATTGGCTGGA 


CTTCGTAGTA 


ATAGCTTTAG 


CTTATGTGAC 


CATGGGCATA 


660 


^gSATTTAGGTA 


ATCTCGCAGC 


TTTGAGAACA 


TTTAGGGTAC 


TGCGAGCTCT 


GAAAACCGTA 


720 


^bcCATTGTGC 

fc*5? 

UJpGCGATGTGA 


C AGGT CT AAA 


AACCATTGTC 


GGTGCTGTCA 


TTGAATCTGT 


AAAAAATCTA 


780 


TAATTTTGAC 


AATGTTTTCC 


CTGTCGGTGT 


TCGCGCTGAT 


GGGCCTACAA 


840 


HATCTATATGG 


GTGTTCTAAC 


AC AAAAGTG C 


ATTAAACGAT 


TCCCCCTGGA 


CGGCAGTTGG 


900 


5 

mggcaatctga 


CCGATGAAAA 


CTGGTTTCTA 


CACAATAGCA 


AC AGT T C C AA 


TTGGTTTACG 


960 


ftGAGAACGATG 


G CG AGT CAT A 


TCCGGTGTGC 


GGGAATGTAT 


CCGGTGCGGG 


ACAATGCGGC 


1020 


jJjAAGATTACG 


TCTGCCTGCA 


GGGCTTCGGC 


CCCAATCCCA 


ACTACGACTA 


CACCAGTTTC 


1080 


^GACTCATTCG 


GTTGGGCTTT 


CCTGTCGGCG 


TTTCGTCTCA 


TGACCCAAGA 


TTTCTGGGAG 


1140 


GATCTGTATC 


AGCACGTGCT 


GCAAGCAGCT 


GGACCCTGGC 


ACATGTTGTT 


CTTTATAGTC 


1200 


AT CATCTTCC 


TAGGTTCATT 


CTATCTTGTG 


AATTTGATTT 


TGGCCATTGT 


TGCCATGTCT 


1260 


TATGACGAAT 


TGCAAAAGAA 


GGCCGAAGAA 


GAAGAGGCTG 


CCGAGGAGGA 


GG CG AT C CG A 


1320 


GAAGCTGAAG 


AAGCGGCAGC 


AGCCAAGGCG 


GCCAAACTGG 


AGGAGCGGGC 


CAATGTAGCA 


1380 


GCTCAAGCGG 


CTCAGGATGC 


AGCGGATGCC 


GCTGCGGCAG 


CTCTGCATCC 


CG AG AT GG C A 


1440 


AAGAGTCCCA 


CGTACTCTTG 


CATTAGCTAT 


GAACTGTTTG 


TTGGCGGCGA 


GAAGGGCAAC 


1500 


GATGACAACA 


ACAAGGAGAA 


GATGTCGATA 


CGCAGCGTCG 


AAGTGGAATC 


GGAGTCGGTG 


1560 
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AGCGTTATAC 


AAAGACAACC 


AGCACCTACC 


ACAGCACCCG 


CTACTAAAGT 


CCGTAAAGTT 


1620 


AGCACGACTT 


CCTTATCCTT 


ACCTGGTTCA 


CCATTTAACC 


TACGCCGGGG 


ATCACGTAGT 


1680 


TCACACAAGT 


ACACAATACG 


AAATGGGCGT 


GGACGTTTTG 


GTATACCAGG 


TAGCGATCGC 


1740 


AAGCCATTGG 


TACTGCAAAC 


ATATCAGGAT 


GCCCAGCAGC 


ATTTGCCCTA 


TGCCGATGAC 


1800 


TCGAATGCCG 


TAACACCAAT 


GTCCGAAGAG 


AATGGTGCCA 


TTATAGTACC 


AGCCTACTAT 


1860 


TGTAATTTAG 


GTT CT AG AC A 


TTCTTCATAT 


ACCTCGCATC 


AATCAAGAAT 


CTCGTATACA 


1920 


T C AC ATGGTG 


ATTTATTGGG 


TGGCATGGCG 


GCCATGGGTG 


CCAGCACAAT 


GACCAAAGAG 


1980 


AGCAAATTGC 


GCAGTCGCAA 


CACACGCAAT 


CAATCAATCG 


GTGCTGCAAC 


CAATGGTGGC 


2040 


AGTAGTACGG 


CCGGTGGTGG 


CTATCCCGAT 


GCCAATCACA 


AGGAACAAAG 


GGATTATGAA 


2100 


J^TGGGTCAGG 


ATTATACAGA 


CGAAGCTGGC 


AAAATAAAAC 


ACCACGACAA 


TCCTTTTATC 


2160 


nGAGCCCGTCC 


AAACTCAAAC 


AGTGGTAGAC 


ATGAAAGATG 


TTATGGTCTT 


AAATGAT AT C 


2220 


i-XJ. 

^ATTGAACAAG 


CCGCTGGTCG 


GCATAGTCGT 


GCTAGTGAAC 


GAGGTGAGGA 


CGATGACGAA 


2280 


yGATGGTCCCA 


CATTCAAGGA 


CATCGCCCTC 


GAATATATCC 


TAAAAGGCAT 


CGAAATCTTT 


2340 


Ltgtgtatggg 


ACTGTTGTTG 


GGTGTGGTTA 


AAATTTCAGG 


AATGGGTCTC 


CTTTATTGTG 


2400 


Wttcgat c cat 


TCGTGGAGCT 


CTTCATTACC 


CTGTGTATTG 


TGGTCAATAC 


AATGTTCATG 


2460 


^ccatggatc 


AT C ACG AC AT 


GAATCCGGAA 


TTGGAGAAGG 


TGCTGAAAAG 


TGGTAACTAT 


2520 


-ttcttcacgg 


CCACTTTTGC 


AATTGAGGCC 


AGCATGAAAC 


TGATGGCCAT 


GAGCCCGAAG 


2580 


TACTACTTCC 


AGGAAGGCTG 


GAACATTTTC 


GATTTCATTA 


TTGTGGCCTT 


GTCTCTGCTG 


2640 


GAATTGGGCC 


' TGGAGGGTGT 


CCAGGGCCTG 


TCGGTGTTGA 


GAAGTTTTCG 


TTTGCTTCGT 


2700 


GTATTCAAAT 


TGGCAAAATC 


ATGGCCCACA 


CTGAATTTAC 


TCATTTCGAT 


TATGGGCCGG 


2760 


ACAATGGGTG 


CATTGGGTAA 


TCTGACATTT 


GTACTTTGCA 


TTATCATCTT 


CATCTTTGCC 


2820 


GTGATGGGAA 


TGCAACTTTT 


CGGAAAGAAC 


TATATTGACC 


AC AAGG AT CG 


CTTCAAGGAC 


2880 


CATGAATTAC 


CGCGCTGGAA 


TTTCACCGAC 


TTCATGCACA 


GCTTCATGAT 


TGTGTTCCGA 


2940 


GTGCTGTGCG 


GAGAGTGGAT 


CGAGTCCATG 


TGGGACTGCA 


TGTATGTGGG 


CGATGTCAGC 


3000 
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TGTATACCCT 


TCTTCTTGGC 


CACGGTCGTG 


ATCGGCAATT 


TTGTGGTTCT 


TAATCTTTTC 


3060 


TTAGCTTTGC 


TTTTGTCCAA 


CTTCGGTTCA 


TCTAGTTTAT 


CAGCCCCGAC 


TGCCGACAAT 


3120 


GATACCAATA 


AAATAGCAGA 


GGCCTTCAAT 


CGTATTGCTC 


GTTTTAAGAA 


CTGGGTGAAA 


3180 


CGTAATATTG 


CCGATTGTTT 


TAAGTTAATT 


CGAAATAAAT 


TGACAAATCA 


AATAAGTGAC 


3240 


CAACCATCAG 


AACATGGCGA 


TAATGAACTG 


GAGTTGGGTC 


ATGACGAAAT 


CATGGGCGAT 


3300 


GGCTTGATCA 


AAAAGGGTAT 


GAAGGGCGAG 


ACCCAGCTGG 


AGGTGGC CAT 


TGGCGATGGC 


3360 


ATGGAGTTCA 


CGATACATGG 


CGATATGAAA 


AACAACAAGC 


CCAAGAAATC 


AAAATT CAT A 


3420 


AACAACACAA 


CGATGATTGG 


AAACTCAATA 


AACCACCAAG 


ACAATAGACT 


GGAACATGAG 


3480 


CJCTAAACCATA 


GAGGTTTGTC 


CATACAGGAC 


GATGACACTG 


CCAGCATTAA 


CTCATATGGT 


3540 


JfAGCCATAAGA AT CGAC C ATT 


CAAGGACGAG 


AG CC AC AAGG 


GCAGCGCCGA 


G A C CAT C GAG 


3600 


iy 

gJGGCGAGGAGA 


AACGCGACGT 


CAGCAAAGAG 


GACCTCGGCC 


TCGACGAGGA 


ACTGGACGAG 


3660 


%Jg AGG C CG AGG 


GCGATGAGGG 


CCAGCTGGAT 


GGTGACATCA 


TCATTCATGC 


CCAAAACGAC 


3720 


< } GACGAGATAA 


T CG ACG ACT A 


TCCGGCCGAC 


TGTTTCCCCG 


ACTCGTACTA 


CAAGAAGTTT 


3780 


tjcCGATCTTGG 


CCGGCGACGA 


GGACTCGCCG 


TTCTGGCAAG 


GATGGGGCAA 


TTTACGACTG 


3840 


^AAAACTTTTO 


AATTAATTGA 


AAATAAATAT 


TTTGAAACCG 


CAGTTATCAC 


TATGATTTTA 


3900 


'^ATGAGTAGCT 


TAGCTTTGGC 


CTTAGAAGAT 


GTTCATTTAC 


CCGATCGACC 


TGTCATGCAG 


3960 


GATATACTGT 


ACTACATGGA 


CAGGATATTT 


ACGGTGATAT 


TCTTTTTGGA 


G ATGTTG AT C 


4020 


AAATGGTTGG 


CCCTGGGCTT 


TAAGGTCTAC 


TTCACCAATG 


CCTGGTGTTG 


GCTGGATTT C 


4080 


GTGATTGTCA 


TGCTATCGCT 


TATAAATTTG 


GTTGCCGTTT 


GGTCGGGCTT 


AAATGATATA 


4140 


GCCGTGTTTA 


GATCAATGCG 


CACACTGCGC 


GCCCTAAGGC 


CATTGCGTGC 


TGTCTCTAGA 


4200 


TGGGAGGGTA 


TGAAAGTTGT 


CGTGAATGCG 


CTGGTTCAAG 


CTATACCGTC 


CATCTTCAAT 


4260 


GTGCTATTGG 


TGTGTCTGAT 


ATTTTGGCTT 


ATTTTTGCCA 


TTATGGGAGT 


ACAGCTTTTT 


4320 


GCTGGAAAAT 


ATTTTAAGTG 


TAAAGATGGT 


AATGACACTG 


TGCTGAGCCA 


TG AAAT CAT A 


4380 


CCGAATCGTA 


ATGCCTGCAA 


AAGTGAAAAC 


TACACCTGGG 


AAAATTCGGC 


AATGAACTTC 


4440 
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GATCATGTAG 


GTAATGCGTA 


TCTCTGTCTA 


TTTCAAGTGG 


CCACCTTTAA 


GGGCTGCATC 


4500 


CAGATTATGA 


ACG ATG C CAT 


TG ATT C ACG A 


GAGGTGGACA 


AGCAGCCGAT 


CCGAGAAACC 


4560 


AATATCTACA 


TGTATTTATA 


TTTCGTATTC 


TTCATTATAT 


TTGGATCATT 


TTTCACACTC 


4620 


AATCTGTTCA 


TTGGTGTTAT 


CATTGATAAT 


TTTAATGAAC 


AAAAGAAGAA 


AGCAGGTGGA 


4680 


TCATTAGAAA 


TGTTCATGAC 


AGAAGATCAG 


AAAAAGTACT 


ATAATGCTAT 


GAAAAAGATG 


4740 


GGCTCTAAAA 


AACCATTAAA 


AGCCATTCCA 


AGACCGAGGT 


GGCGACCACA 


AGCAATAGTA 


4800 


TTCGAAATAG 


TTACAGATAA 


AAAATTCGAT 


ATAATCATTA 


TGTTGTT CAT 


TG G C T T AAAC 


4860 


ATGTTTACCA 


TGACCCTCGA 


TCGGTACGAC 


GCCTCCGAGG 


CGTACAACAA 


TGTCCTCGAC 


4920 


y^AACTCAATG GGATATTCGT AGTTATTTTC 


AGTGGCGAAT 


GTCTATTAAA 


AATATTCGCT 


4980 


JtTTACGATATC ACTATTTCAA 


AGAGCCATGG 


AATTTATTTG 


ATGTAGTAGT 


TGTCATTTTA 


5040 


rtTCCATCTTAG 


GTCTTGTACT 


CAGCGACATC 


ATTGAGAAGT 


ATTTCGTATC 


GCCGACACTG 


5100 


N bTCCGTGTGG 


TGAGAGTGGC 


CAAAGTGGGT 


CGTGTCCTGC 


GTTTAGTCAA 


GGGTGCCAAG 


5160 


> GGTATCCGGA 


CGTTGCTGTT 


CGCGTTAGCC 


ATGTCGTTGC 


CTGCCTTATT 


CAACATTTGT 


5220 


Gctgttgctgt 


TCTTGGTGAT 


GTTCATCTTT 


GCTATCTTTG 


GCATGTCCTT 


CTTCATGCAT 


5280 


OjGTCAAAGAGA 


AGAGCGGCAT 


AAATGCTGTG 


TATAATTTTA 


AGACATTTGG 


CCAAAGTATG 


5340 


J2ATATTGCTGT 


TTCAGATGTC 


TACCTCAGCC 


GGTTGGGATG 


GTGTGTTAGA 


TGCCATTATC 


5400 


AATGAGGAAG 


ATTGCGATCC 


ACCCGACAAC 


GACAAGGGCT 


ATCCGGGCAA 


TTGTGGTTCA 


5460 


GCGACTGTTG 


GAATTACGTT 


TCTCCTTTCA 


TATCTAGTTA 


TAAGCTTTTT 


GATAGTTATT 


5520 


AATATGTACA 


TTGCTGTCAT 


TCTCGAGAAC 


TATAGCCAGG 


CTACGGAGGA 


TGTACAGGAG 


5580 


GGTCTCACCG 


ACGACGACTA 


TGATATGTAC 


TACGAGATTT 


GGCAACAATT 


CGATCCGGAG 


5640 


GGTACCCAGT 


ACATAAGATA 


CGACCAGCTG 


TCCGAGTTCC 


TGGACGTGCT 


GGAGCCGCCG 


5700 


C TG C AG AT C C 


ACAAGCCGAA 


CAAGTACAAA 


ATCATATCGA 


TGGACATGCC 


GATATGTCGG 


5760 


GGCGACATGA 


TGTACTGTGT 


GGATATATTG 


GATGCCCTGA 


CCAAGGACTT 


CTTTGCGCGC 


5820 


AAGGGTAATC 


CGATCGAGGA 


. GACGGGTGAA 


ATTGGTGAGA 


TTGCGGCGCG 


ACCGGACACC 


5880 
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GAGGGCTATG ATCCGGTGTC GTCGACACTG TGGCGCCAGC GTGAGGAGTA CTGCGCCAAG 5 94 0 

CTGATACAGA ATGCGTGGCG GCGTTACAAG AATGG C C C AC CCCAGGAGGG TGATGAGGGC 6 000 

GAGGCGGCTG GTGGCGAAGA TGGTGCTGAA GGCGGTGAGG GTGAAGGCGG CAGCGGCGGC 6060 

GGCGGCGATG ATGATGGTGG CTCAGCGACG GCGGCGGGAG CCACATCACC CACAGATCCA 612 0 

GATGCCGGCG AAGCAGATGG TGCCAGCGCC GGCAATGGTG GCGGCCCCCT TAGTCCGGGC 618 0 

TGTGTTAGTG GCGGCAGTAA TGGCCGCCAA ACGGCCGTAC TGGTCGAAAG CGATGGTTTT 624 0 

GTTACAAAAA ACGGTCATAA GGTTGTAATA CACTCGAGAT CGCCGAGCAT AACATCCAGG 63 00 

ACGGCAGATG TCTGA 6 315 
n(2) INFORMATION FOR SEQ ID NO : 3 : 



\ y 
U 

u 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2105 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

Met Thr Glu Asp Ser Asp Ser lie Ser Glu Glu Glu Arg Ser Leu Phe 
1 5 10 15 

Arg Pro Phe Thr Arg Glu Ser Leu Leu Gin lie Glu Gin Arg lie Ala 
20 25 30 

Glu His Glu Lys Gin Lys Glu Leu Glu Arg Lys Arg Ala Ala Glu Gly 
35 40 45 

Glu Gin lie Arg Tyr Asp Asp Glu Asp Glu Asp Glu Gly Pro Gin Pro 
50 55 60 

Asp Pro Thr Leu Glu Gin Gly Val Pro lie Pro Val Arg Met Gin Gly 
65 70 75 80 

Ser Phe Pro Pro Glu Leu Ala Ser Thr Pro Leu Glu Asp lie Asp Pro 

85 90 95 



O 
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Phe Tyr Ser Asn Val Leu Thr Phe Val Val lie Ser Lys Gly Lys Asp 
100 105 110 

lie Phe Arg Phe Ser Ala Ser Lys Ala Met Trp Leu Leu Asp Pro Phe 
115 120 125 

Asn Pro lie Arg Arg Val Ala lie Tyr lie Leu Val His Pro Leu Phe 
130 135 140 

Ser Leu Phe lie lie Thr Thr lie Leu Thr Asn Cys lie Leu Met lie 
i45 150 155 160 

Met Pro Thr Thr Pro Thr Val Glu Ser Thr Glu Val lie Phe Thr Gly 

165 170 175 

lie Tyr Thr Phe Glu Ser Ala Val Lys Val Met Ala Arg Gly Phe lie 
180 185 190 

feO Leu Cys Pro Phe Thr Tyr Leu Arg Asp Ala Trp Asn Trp Leu Asp Phe 

£ 195 200 205 



63 



Val Val lie Ala Leu Ala Tyr Val Thr Met Gly lie Asp Leu Gly Asn 
210 215 220 

Leu Ala Ala Leu Arg Thr Phe Arg Val Leu Arg Ala Leu Lys Thr Val 
225 230 235 240 

Ala lie Val Pro Gly Leu Lys Thr lie Val Gly Ala Val lie Glu Ser 

245 250 255 

Val Lys Asn Leu Arg Asp Val lie lie Leu Thr Met Phe Ser Leu Ser 
260 265 270 

Val Phe Ala Leu Met Gly Leu Gin lie Tyr Met Gly Val Leu Thr Gin 
275 280 285 

Lys Cys lie Lys Arg Phe Pro Leu Asp Gly Ser Trp Gly Asn Leu Thr 
290 295 300 

Asp Glu Asn Trp Phe Leu His Asn Ser Asn Ser Ser Asn Trp Phe Thr 
305 310 315 320 

Glu Asn Asp Gly Glu Ser Tyr Pro Val Cys- Gly Asn Val Ser Gly Ala 

325 330 335 

Gly Gin Cys Gly Glu Asp Tyr Val Cys Leu Gin Gly Phe Gly Pro Asn 
340 345 350 



Pro Asn Tyr Asp Tyr Thr Ser Phe Asp Ser Phe Gly Trp Ala Phe Leu 
355 360 365 

Ser Ala Phe Arg Leu Met Thr Gin Asp Phe Trp Glu Asp Leu Tyr Gin 
370 375 380 

His Val Leu Gin Ala Ala Gly Pro Trp His Met Leu Phe Phe lie Val 
385 390 395 400 

lie lie Phe Leu Gly Ser Phe Tyr Leu Val Asn Leu lie Leu Ala lie 

405 410 415 

Val Ala Met Ser Tyr Asp Glu Leu Gin Lys Lys Ala Glu Glu Glu Glu 
420 425 430 

Ala Ala Glu Glu Glu Ala lie Arg Glu Ala Glu Glu Ala Ala Ala Ala 
435 440 445 

Lys Ala Ala Lys Leu Glu Glu Arg Ala Asn Val Ala Ala Gin Ala Ala 
450 455 460 

Gin Asp Ala Ala Asp Ala Ala Ala Ala Ala Leu His Pro Glu Met Ala 
465 470 475 480 

Lys Ser Pro Thr Tyr Ser Cys lie Ser Tyr Glu Leu Phe Val Gly Gly 

485 490 495 

Glu Lys Gly Asn Asp Asp Asn Asn Lys Glu Lys Met Ser lie Arg Ser 
500 505 510 

Val Glu Val Glu Ser Glu Ser Val Ser Val lie Gin Arg Gin Pro Ala 
515 520 525 

Pro Thr Thr Ala Pro Ala Thr Lys Val Arg Lys Val Ser Thr Thr Ser 
530 535 540 

Leu Ser Leu Pro Gly Ser Pro Phe Asn Leu Arg Arg Gly Ser Arg Ser 
545 550 555 560 

Ser His Lys Tyr Thr lie Arg Asn Gly Arg Gly Arg Phe Gly lie Pro 

565 570 575 

Gly Ser Asp Arg Lys Pro Leu Val Leu Gin Thr Tyr Gin Asp Ala Gin 
580 585 590 

Gin His Leu Pro Tyr Ala Asp Asp Ser Asn Ala Val Thr Pro Met Ser 
595 600 605 - 
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Glu Glu Asn Gly Ala 
610 

Ser Arg His Ser Ser 
625 

Ser His Gly Asp Leu 

645 

Met Thr Lys Glu Ser 
660 

lie Gly Ala Ala Thr 
675 

Pro Asp Ala Asn His 
690 

Tyr Thr Asp Glu Ala 
705 

Glu Pro Val Gin Thr 

725 

Leu Asn Asp lie lie 
740 

Glu Arg Gly Glu Asp 
755 

Ala Leu Glu Tyr lie 
770 

Cys Cys Trp Val Trp 
785 

Phe Asp Pro Phe Val 

805 

Thr Met Phe Met Ala 
820 

Lys Val Leu Lys Ser 
835 

Glu Ala Ser Met Lys 
850 



lie lie Val Pro Ala Tyr 
615 

Tyr Thr Ser His Gin Ser 
630 635 

Leu Gly Gly Met Ala Ala 

650 

Lys Leu Arg Ser Arg Asn 
665 

Asn. Gly Gly Ser Ser Thr 
680 

Lys Glu Gin Arg Asp Tyr 
695 

Gly Lys lie Lys His His 
710 715 

Gin Thr Val Val Asp Met 

730 

Glu Gin Ala Ala Gly Arg 
745 

Asp Asp Glu Asp Gly Pro 
760 

Leu Lys Gly lie Glu lie 
775 

Leu Lys Phe Gin Glu Trp 
790 795 

Glu Leu Phe lie Thr Leu 

810 

Met £sp His His Asp Met 
825 

Gly Asn Tyr Phe Phe- Thr 
840 

Leu Met Ala Met Ser Pro 
855 



Tyr Cys Asn Leu Gly 
620 

Arg lie Ser Tyr Thr 

640 

Met Gly Ala Ser Thr 
655 

Thr Arg Asn Gin Ser 
670 

Ala Gly Gly Gly Tyr 
685 

Glu Met Gly Gin Asp 
700 

Asp Asn Pro Phe lie 

720 

Lys Asp Val Met Val 
735 

His Ser Arg Ala Ser 
750 

Thr Phe Lys Asp lie 
765 

Phe Cys Val Trp Asp 
780 

Val Ser Phe lie Val 

800 

Cys lie Val Val Asn 
815 

Asn Pro Glu Leu Glu 
830 

Ala Thr Phe Ala lie 
845 

Lys Tyr Tyr Phe Gin 
860 



Glu Gly Trp Asn lie Phe Asp Phe lie lie Val Ala Leu Ser Leu Leu 
865 870 875 880 

Glu Leu Gly Leu Glu Gly Val Gin Gly Leu Ser Val Leu Arg Ser Phe 

885 890 895 

Arg Leu Leu Arg Val Phe Lys Leu Ala Lys Ser Trp Pro Thr Leu Asn 
900 905 910 

Leu Leu lie Ser lie Met Gly Arg Thr Met Gly Ala Leu Gly Asn Leu 
915 920 925 

Thr Phe Val Leu Cys lie lie lie Phe lie Phe Ala Val Met Gly Met 
930 935 940 

Gin Leu Phe Gly Lys Asn Tyr lie Asp His Lys Asp Arg Phe Lys Asp 
945 950 955 960 

His Glu Leu Pro Arg Trp Asn Phe Thr Asp Phe Met His Ser Phe Met 

965 970 975 

lie Val Phe Arg Val Leu Cys Gly Glu Trp lie Glu Ser Met Trp Asp 
98 0 985 990 

Cys Met Tyr Val Gly Asp Val Ser Cys lie Pro Phe Phe Leu Ala Thr 
995 1000 1005 

Val Val lie Gly Asn Leu Val Val Leu Asn Leu Phe Leu Ala Leu Leu 
1010 1015 1020 

Leu Ser Asn Phe Gly Ser Ser Ser Leu Ser Ala Pro Thr Ala Asp Asn 
1025 1030 1035 1040 

Asp Thr Asn Lys lie Ala Glu Ala Phe Asn Arg lie Ala Arg Phe Lys 

1045 1050 1055 

Asn Trp Val Lys Arg Asn lie Ala Asp Cys Phe Lys Leu lie Arg Asn 
1060 1065 1070 

Lys Leu Thr Asn Gin lie Ser Asp Gin Pro Ser Glu His Gly Asp Asn 
1075 1080 1085 

Glu Leu Glu Leu Gly His Asp Glu lie Met Gly Asp Gly Leu lie Lys 
1090 1095 1100 

Lys Gly Met Lys Gly Glu Thr Gin Leu Glu Val Ala lie Gly Asp Gly 
1105 1110 1115 1120 
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Met Glu Phe Thr lie His Gly Asp Met Lys Asn Asn Lys Pro Lys Lys 

1125 1130 1135 

Ser Lys Phe Met Asn Asn Thr Thr Met lie Gly Asn Ser lie Asn His 
1140 1145 1150 

Gin Asp Asn Arg Leu Glu His Glu Leu Asn His Arg Gly Leu Ser lie 
1155 1160 1165 

Gin Asp Asp Asp Thr Ala Ser lie Asn Ser Tyr Gly Ser His Lys Asn 
1170 1175 1180 

Arg Pro Phe Lys Asp Glu Ser His Lys Gly Ser Ala Glu Thr lie Glu 
1185 1190 1195 1200 

Gly Glu Glu Lys Arg Asp Val Ser Lys Glu Asp Leu Gly Leu Asp Glu 

1205 1210 1215 

Glu Leu Asp Glu Glu Ala Glu Gly Asp Glu Gly Gin Leu Asp Gly Asp 
1220 1225 1230 



UJ lie lie lie His Ala Gin Asn Asp Asp Glu lie lie Asp Asp Tyr Pro 

SI 1235 1240 1245 

s Ala Asp Cys Phe Pro Asp Ser Tyr Tyr Lys Lys Phe Pro lie Leu Ala 

1250 1255 1260 

Q 

Fj Gly Asp Glu Asp Ser Pro Phe Trp Gin Gly Trp Gly Asn Leu Arg Leu 

03 1265 1270 1275 1280 

r *5 Lys Thr Phe Gin Leu lie Glu Asn Lys Tyr Phe Glu Thr Ala Val lie 

w 1285 1290 1295 

Thr Met lie Leu Met Ser Ser Leu Ala Leu Ala Leu Glu Asp Val His 
1300 1305 1310 

Leu Pro Asp Arg Pro Val Met Gin Asp lie Leu Tyr Tyr Met Asp Arg 
1315 1320 1325 

lie Phe Thr Val lie Phe Phe Leu Glu Met Leu lie Lys Trp Leu Ala 
1330 1335 1340 

Leu Gly Phe Lys Val Tyr Phe Thr Asn Ala - Trp Cys Trp Leu Asp Phe 
1345 1350 1355 1360 

Val lie Val Met Leu Ser Leu lie Asn Leu Val Ala Val Trp Ser Gly 

1365 1370 1375 
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Leu Asn Asp lie Ala Val Phe Arg Ser Met Arg Thr Leu Arg Ala Leu 
1380 1385 1390 

Arg Pro Leu Arg Ala Val Ser Arg Trp Glu Gly Met Lys Val Val Val 
1395 1400 1405 

Asn Ala Leu Val Gin Ala lie Pro Ser lie Phe Asn Val Leu Leu Val 
1410 1415 1420 

Cys Leu lie Phe Trp Leu lie Phe Ala lie Met Gly Val Gin Leu Phe 
1425 1430 1435 1440 

Ala Gly Lys Tyr Phe Lys Cys Lys Asp Gly Asn Asp Thr Val Leu Ser 

1445 1450 1455 

His Glu lie lie Pro Asn Arg Asn Ala Cys Lys Ser Glu Asn Tyr Thr 
1460 1465 1470 

Trp Glu Asn Ser Ala Met Asn Phe Asp His Val Gly Asn Ala Tyr Leu 
1475 1480 1485 

Cys Leu Phe Gin Val Ala Thr Phe Lys Gly Trp lie Gin lie Met Asn 
1490 1495 1500 

Asp Ala lie Asp Ser Arg Glu Val Asp Lys Gin Pro lie Arg Glu Thr 
1505 1510 1515 1520 

Asn lie Tyr Met Tyr Leu Tyr Phe Val Phe Phe lie lie Phe Gly Ser 

1525 1530 1535 

Phe Phe Thr Leu Asn Leu Phe lie Gly Val lie lie Asp Asn Phe Asn 
1540 1545 1550 

Glu Gin Lys Lys Lys Ala Gly Gly Ser Leu Glu Met Phe Met Thr Glu 
1555 1560 1565 

Asp Gin Lys Lys Tyr Tyr Asn Ala Met Lys Lys Met Gly Ser Lys Lys 
1570 1575 1580 

Pro Leu Lys Ala lie Pro Arg Pro Arg Trp Arg Pro Gin Ala lie Val 
1585 1590 1595 1600 

Phe Glu lie Val Thr Asp Lys Lys Phe Asp ■ lie lie lie Met Leu Phe 

1605 1610 1615 

lie Gly Leu Asn Met Phe Thr Met Thr Leu Asp Arg Tyr Asp Ala Ser 
1620 1625 1630 
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Glu Ala Tyr Asn Asn Val Leu Asp Lys Leu Asn Gly lie Phe Val Val 
1635 1640 1645 

lie Phe. Ser Gly Glu Cys Leu Leu Lys lie Phe Ala Leu Arg Tyr His 
1650 1655 1660 

Tyr Phe Lys Glu Pro Trp Asn Leu Phe Asp Val Val Val Val lie Leu 
1665 1670 1675 1680 

Ser lie Leu Gly Leu Val Leu Ser Asp lie lie Glu Lys Tyr Phe Val 

1685 1690 1695 

Ser Pro Thr Leu Leu Arg Val Val Arg Val Ala Lys Val Gly Arg Val 
1700 1705 1710 

Leu Arg Leu Val Lys Gly Ala Lys Gly lie Arg Thr Leu Leu Phe Ala 
Q 1715 1720 1725 

JI Leu Ala Met Ser Leu Pro Ala Leu Phe Asn lie Cys Leu Leu Leu Phe 
nj 1730 1735 1740 

hj Leu Val Met Phe lie Phe Ala lie Phe Gly Met Ser Phe Phe Met His 

%j 1745 1750 1755 1760 

/ Val Lys Glu Lys Ser Gly lie Asn Ala Val Tyr Asn Phe Lys Thr Phe 

1765 1770 1775 

51 Gly Gin Ser Met lie Leu Leu Phe Gin Met Ser Thr Ser Ala Gly Trp 

1780 1785 1790 

Asp Gly Val Leu Asp Ala lie lie Asn Glu Glu Asp Cys Asp Pro Pro 
^ 1795 1800 1805 

Asp Asn Asp Lys Gly Tyr Pro Gly Asn Cys Gly Ser Ala Thr Val Gly 
1810 1815 1820 

lie Thr Phe Leu Leu Ser Tyr Leu Val lie Ser Phe Leu lie Val lie 
1825 1830 1835 1840 

Asn Met Tyr lie Ala Val lie Leu Glu Asn Tyr Ser Gin Ala Thr Glu 

1845 1850 1855 

Asp Val Gin Glu Gly Leu Thr Asp Asp Asp - Tyr Asp Met Tyr Tyr Glu 
1860 1865 1870 



lie Trp Gin Gin Phe Asp Pro Glu Gly Thr Gin Tyr lie Arg Tyr Asp 
1875 1880' 1885 




Gin Leu Ser Glu Phe Leu Asp Val Leu Glu Pro Pro Leu Gin lie His 
1890 1895 1900 

Lys Pro Asn Lys Tyr Lys lie lie Ser Met Asp Met Pro lie Cys Arg 
1905 1910 1915 1920 

Gly Asp Met Met Tyr Cys Val Asp lie Leu Asp Ala Leu Thr Lys Asp 

1925 1930 1935 

Phe Phe Ala Arg Lys Gly Asn Pro lie Glu Glu Thr Gly Glu lie Gly 
1940 1945 1950 

Glu lie Ala Ala Arg Pro Asp Thr Glu Gly Tyr Asp Pro Val Ser Ser 
1955 1960 1965 

Thr Leu Trp Arg Gin Arg Glu Glu Tyr Cys Ala Lys Leu lie Gin Asn 
1970 1975 1980 

Ala Trp Arg Arg Tyr Lys Asn Gly Pro Pro Gin Glu Gly Asp Glu Gly 
1985 1990 1995 2000 

Glu Ala Ala Gly Gly Glu Asp Gly Ala Glu Gly Gly Glu Gly Glu Gly 

2005 2010 2015 

Gly Ser Gly Gly Gly Gly Gly Asp Asp Gly Gly Ser Ala Thr Gly Ala 
2020 2025 2030 

Thr Ala Ala Ala Gly Ala Thr Ser Pro Ser Asp Pro Asp Ala^Gly Glu 
2035 2040 2045 

Ala Asp Gly Ala Ser Val Gly Gly Pro Leu Ser Pro Gly Cys Val Ser 
2050 2055 2060 

Gly Gly Ser Asn Gly Arg Gin Thr Ala Val Leu Val Glu Ser Asp Gly 
2065 2070 2075 2080 

Phe Val Thr Lys Asn Gly His Lys Val Val lie His Ser Arg Ser Pro 

2085 2090 2095 

Ser lie Thr Ser Arg Thr Ala Asp Val 
2100 2105 



(2) INFORMATION FOR SEQ ID NO : 4 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2104 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Met Thr Glu Asp Ser Asp Ser lie Ser Glu Glu Glu Arg Ser Leu Phe 
15 10 15 

Arg Pro Phe Thr Arg Glu Ser Leu Leu Gin lie Glu Gin Arg lie Ala 
20 25 30 

n 

Glu His Glu Lys Gin Lys Glu Leu Glu Arg Lys Arg Ala Ala Glu Gly 
35 40 45 

Glu Gin lie Arg Tyr Asp Asp Glu Asp Glu Asp Glu Gly Pro Gin Pro 
50 55 60 

Asp Pro Thr Leu Glu Gin Gly Val Pro lie Pro Val Arg Met Gin Gly 
65 70 75 80 

Ser Phe Pro Pro Glu Leu Ala Ser Thr Pro Leu Glu Asp lie Asp Pro 

85 90 95 

Phe Tyr Ser Asn Val Leu Thr Phe Val Val lie Ser Lys Gly Lys Asp 
100 105 110 

lie Phe Arg Phe Ser Ala Ser Lys Ala Met Trp Leu Leu Asp Pro Phe 
115 120 125 

Asn Pro lie Arg Arg Val Ala lie Tyr lie Leu Val His Pro Leu Phe 
130 135 140 

Ser Leu Phe lie lie Thr Thr lie Leu Thr Asn Cys lie Leu Met lie 
145 150 155 160 

Met Pro Thr Thr Pro Thr Val Glu Ser Thr Glu Val lie Phe Thr Gly 

165 170 175 

lie Tyr Thr Phe Glu Ser Ala Val Lys Val • Met Ala Arg Gly Phe lie 
180 185 190 

Leu Cys Pro Phe Thr Tyr Leu Arg Asp Ala Trp Asn Trp Leu Asp Phe 
195 200 ' 205 



Val Val lie Ala Leu Ala Tyr Val Thr Met Gly lie Asp Leu Gly Asn 
210 215 220 

Leu Ala Ala Leu Arg Thr Phe Arg Val Leu Arg Ala Leu Lys Thr Val 
225 230 235 240 

Ala lie Val Pro Gly Leu Lys Thr lie Val Gly Ala Val lie Glu Ser 

245 250 255 

Val Lys Asn Leu Arg Asp Val lie lie Leu Thr Met Phe Ser Leu Ser 
260 265 270 

Val Phe Ala Leu Met Gly Leu Gin lie Tyr Met Gly Val Leu Thr Gin 
275 280 285 

Lys Cys lie Lys Arg Phe Pro Leu Asp Gly Ser Trp Gly Asn Leu Thr 
290 295 300 

Asp Glu Asn Trp Phe Leu His Asn Ser Asn Ser Ser Asn Trp Phe Thr 
305 310 315 320 

Glu Asn Asp Gly Glu Ser Tyr Pro Val Cys Gly Asn Val Ser Gly Ala 

325 330 335 

Gly Gin Cys Gly Glu Asp Tyr Val Cys Leu Gin Gly Phe Gly Pro Asn 
340 345 350 

Pro Asn Tyr Asp Tyr Thr Ser Phe Asp Ser Phe Gly Trp Ala Phe Leu 
355 360 365 

Ser Ala Phe Arg Leu Met Thr Gin Asp Phe Trp Glu Asp Leu Tyr Gin 
370 375 380 

His Val Leu Gin Ala Ala Gly Pro Trp His Met Leu Phe Phe lie Val 
385 390 395 400 

lie lie Phe Leu Gly Ser Phe Tyr Leu Val Asn Leu lie Leu Ala lie 

405 410 415 

Val Ala Met Ser Tyr Asp Glu Leu Gin Lys Lys Ala Glu Glu Glu Glu 
420 425 430 

Ala Ala Glu Glu Glu Ala lie Arg Glu Ala Glu Glu Ala Ala Ala Ala 
435 440 445 

Lys Ala Ala Lys Leu Glu Glu Arg Ala Asn Val Ala Ala Gin Ala Ala 
450 455 460 
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Gin Asp Ala Ala Asp Ala Ala Ala Ala Ala Leu His Pro Glu Met Ala 
465 470 475 480 

Lys Ser Pro Thr Tyr Ser Cys lie Ser Tyr Glu Leu Phe Val Gly Gly 

485 490 495 

Glu Lys Gly Asn Asp Asp Asn Asn Lys Glu Lys Met Ser lie Arg Ser 
500 505 510 

Val Glu Val Glu Ser Glu Ser Val Ser Val lie Gin Arg Gin Pro Ala 
515 520 525 

Pro Thr Thr Ala Pro Ala Thr Lys Val Arg Lys Val Ser Thr Thr Ser 
530 535 540 

Leu Ser Leu Pro Gly Ser Pro Phe Asn Leu Arg Arg Gly Ser Arg Ser 
545 550 555 560 

Ser His Lys Tyr Thr lie Arg Asn Gly Arg Gly Arg Phe Gly lie Pro 

565 570 575 

Gly Ser Asp Arg Lys Pro Leu Val Leu Gin Thr Tyr Gin Asp Ala Gin 
580 585 590 

Gin His Leu Pro Tyr Ala Asp Asp Ser Asn Ala Val Thr Pro Met Ser 
595 600 605 

Glu Glu Asn Gly Ala lie lie Val Pro Ala Tyr Tyr Cys Asn Leu Gly 
610 615 620 

Ser Arg His Ser Ser Tyr Thr Ser His Gin Ser Arg lie Ser Tyr Thr 
625 630 635 640 

Ser His Gly Asp Leu Leu Gly Gly Met Ala Ala Met Gly Ala Ser Thr 

645 650 655 

Met Thr Lys Glu Ser Lys Leu Arg Ser Arg Asn Thr Arg Asn Gin Ser 
660 665 670 

lie Gly Ala Ala Thr Asn Gly Gly Ser Ser Thr Ala Gly Gly Gly Tyr 
675 680 685 

Pro Asp Ala Asn His Lys Glu Gin Arg Asp- Tyr Glu Met Gly Gin Asp 
690 695 700 

Tyr Thr Asp Glu Ala Gly Lys lie -Lys His His Asp Asn Pro Phe lie 
705 710 715 720 



( 
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Glu Pro Val Gin Thr Gin Thr Val Val Asp Met Lys Asp Val Met Val 

725 730 735 

Leu Asn Asp lie lie Glu Gin Ala Ala Gly Arg His Ser Arg Ala Ser 
740 745 750 

Glu Arg Gly Glu Asp Asp Asp Glu Asp Gly Pro Thr Phe' Lys Asp lie 
755 760 765 

Ala Leu Glu Tyr lie Leu Lys Gly lie Glu lie Phe Cys Val Trp Asp 
770 775 780 

Cys Cys Trp Val Trp Leu Lys Phe Gin Glu Trp Val Ser Phe lie Val 
785 790 795 800 

Phe Asp Pro Phe Val Glu Leu Phe lie Thr Leu Cys lie Val Val Asn 

805 810 315 

Thr Met Phe Met Ala Met Asp His His Asp Met Asn Pro Glu Leu Glu 
820 825 830 

Lys Val Leu Lys Ser Gly Asn Tyr Phe Phe Thr Ala Thr Phe Ala lie 
835 840 845 . 

Glu Ala Ser Met Lys Leu Met Ala Met Ser Pro Lys Tyr Tyr Phe Gin 
850 855 860 

Glu Gly Trp Asn lie Phe Asp Phe lie lie Val Ala Leu Ser Leu Leu 
865 870 875 880 

Glu Leu Gly Leu Glu Gly Val Gin Gly Leu Ser Val Leu Arg Ser Phe 

885 890 895 

Arg Leu Leu Arg Val Phe Lys Leu Ala Lys Ser Trp Pro Thr Leu Asn 
900 905 910 

Leu Leu lie Ser lie Met Gly Arg Thr Met Gly Ala Leu Gly Asn Leu 
915 920 925 

Thr Phe Val Leu Cys lie lie lie Phe lie Phe Ala Val Met Gly Met 
930 935 940 

Gin Leu Phe Gly Lys Asn Tyr lie Asp His Lys Asp Arg Phe Lys Asp 
945 950 955 960 

His Glu Leu Pro Arg Trp Asn Phe Thr Asp Phe Met His Ser Phe Met 

965 970 975 



lie Val Phe Arg Val Leu Cys Gly Glu Trp lie Glu Ser Met Trp Asp 
980 985 990 

Cys Met Tyr Val Gly Asp Val Ser Cys lie Pro Phe Phe Leu Ala Thr 
995 1000 1005 

Val Val lie Gly Asn Phe Val Val Leu Asn Leu Phe Leu Ala Leu Leu 
1010 1015 1020 

Leu Ser Asn Phe Gly Ser Ser Ser Leu Ser Ala Pro Thr Ala Asp Asn 
1025 1030 1035 1040 

Asp Thr Asn Lys lie Ala Glu Ala Phe Asn Arg lie Ala Arg Phe Lys 

1045 1050 1055 

Asn Trp Val Lys Arg Asn lie Ala Asp Cys Phe Lys Leu lie Arg Asn 
1060 1065 1070 

Lys Leu Thr Asn Gin lie Ser Asp Gin Pro Ser Glu His Gly Asp Asn 
1075 1080 1085 

Glu Leu Glu Leu Gly His Asp Glu lie Met Gly Asp Gly Leu lie Lys 
1090 1095 1100 

Lys Gly Met Lys Gly Glu Thr Gin Leu Glu Val Ala lie Gly Asp Gly 
1105 1110 1115 1120 

Met Glu Phe Thr lie His Gly Asp Met Lys Asn Asn Lys Pro Lys Lys 

1125 1130 1135 

Ser Lys Phe lie Asn Asn Thr Thr Met lie Gly Asn Ser lie Asn His 
1140 1145 1150 

Gin Asp Asn Arg Leu Glu His Glu Leu Asn His Arg Gly Leu Ser lie 
1155 1160 1165 

Gin Asp Asp Asp Thr Ala Ser lie Asn Ser Tyr Gly Ser His Lys Asn 
1170 1175 1180 

Arg Pro Phe Lys Asp Glu Ser His Lys Gly Ser Ala Glu Thr lie Glu 
1185 1190 1195 1200 

Gly Glu Glu Lys Arg Asp Val Ser Lys Glu Asp Leu Gly Leu Asp Glu 

1205 1210 1215 

Glu Leu Asp Glu Glu Ala Glu Gly Asp Glu Gly Gin Leu Asp Gly Asp 
1220 1225 1230 
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lie lie lie His Ala Gin Asn Asp Asp Glu lie lie Asp Asp Tyr Pro 
1235 1240 1245 

Ala Asp Cys Phe Pro Asp Ser Tyr Tyr Lys Lys Phe Pro lie Leu Ala 
1250 1255 1260 

Gly Asp Glu Asp Ser Pro Phe Trp Gin Gly Trp Gly Asn Leu Arg Leu 
1265 1270 1275 1280 

Lys Thr Phe Gin Leu lie Glu Asn Lys Tyr Phe Glu Thr Ala Val lie 

1285 1290 1295 

Thr Met lie Leu Met Ser Ser Leu Ala Leu Ala Leu Glu Asp Val His 
1300 1305 1310 

Leu Pro Asp Arg Pro Val Met Gin Asp lie Leu Tyr Tyr Met Asp Arg 
1315 1320 1325 

lie Phe Thr Val lie Phe Phe Leu Glu Met Leu lie Lys Trp Leu Ala 
1330 1335 1340 

Leu Gly Phe Lys Val Tyr Phe Thr Asn Ala Trp Cys Trp Leu Asp Phe 
1345 1350 " 1355 1360 

Val lie Val Met Leu Ser Leu lie Asn Leu Val Ala Val Trp Ser Gly 

1365 1370 1375 

Leu Asn Asp lie Ala Val Phe Arg Ser Met Arg Thr Leu Arg Ala Leu 
1380 1385 1390 

Arg Pro Leu Arg Ala Val Ser Arg Trp Glu Gly Met Lys Val Val Val 
1395 1400 1405 

Asn Ala Leu Val Gin Ala lie Pro Ser lie Phe Asn Val Leu Leu Val 
1410 1415 1420 

Cys Leu lie Phe Trp Leu lie Phe Ala lie Met Gly Val Gin Leu Phe 
1425 1430 1435 1440 

Ala Gly Lys Tyr Phe Lys Cys Lys Asp Gly Asn Asp Thr Val Leu Ser 

1445 1450 1455 

His Glu lie lie Pro Asn Arg Asn Ala Cys Lys Ser Glu Asn Tyr Thr 
1460 1465 1470 

Trp Glu Asn Ser Ala Met Asn Phe Asp His Val Gly Asn Ala Tyr Leu 
1475 1480 1485 



Cys Leu Phe Gin Val Ala Thr Phe Lys Gly Trp lie Gin lie Met Asn 
1490 1495 1500 

Asp Ala lie Asp Ser Arg Glu Val Asp Lys Gin Pro lie Arg Glu Thr 
1505 1510 1515 1520 

Asn lie Tyr Met Tyr Leu Tyr Phe Val Phe Phe lie lie Phe Gly Ser 

1525 1530 1535 

Phe Phe Thr Leu Asn Leu Phe lie Gly Val lie lie Asp Asn Phe Asn 
1540 1545 1550 

Glu Gin Lys Lys Lys Ala Gly Gly Ser Leu Glu Met Phe Met Thr Glu 
1555 1560 1565 

Asp Gin Lys Lys Tyr Tyr Asn Ala Met Lys Lys Met Gly Ser Lys Lys 
1570. 1575 1580 

Pro Leu Lys Ala lie Pro Arg Pro Arg Trp Arg Pro Gin Ala lie Val 
1585 1590 1595 1600 

Phe Glu lie Val Thr Asp Lys Lys Phe Asp lie lie lie Met Leu Phe 

1605 1610 1615 

lie Gly Leu Asn Met Phe Thr Met Thr Leu Asp Arg Tyr Asp Ala Ser 
1620 1625 1630 

Glu Ala Tyr Asn Asn Val Leu Asp Lys Leu Asn Gly lie Phe Val Val 
1635 1640 1645 

lie Phe Ser Gly Glu Cys Leu Leu Lys lie Phe Ala Leu Arg Tyr His 
1650 1655 1660 

Tyr Phe Lys Glu Pro Trp Asn Leu Phe Asp Val Val Val Val lie Leu 
1665 1670 1675 1680 

Ser lie Leu Gly Leu Val Leu Ser Asp lie lie Glu Lys Tyr Phe Val 

1685 1690 1695 

Ser Pro Thr Leu Leu Arg Val Val Arg Val Ala Lys Val Gly Arg Val 
1700 1705 1710 

Leu Arg Leu Val Lys Gly Ala Lys Gly lie Arg Thr Leu Leu Phe Ala 
1715 1720 1725 

Leu Ala Met Ser Leu Pro Ala Leu Phe Asn lie Cys Leu Leu Leu Phe 
1730 1735 ' 1740 
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Leu Val Met Phe lie Phe Ala lie Phe Gly Met Ser Phe Phe Met His 
1745 1750 1755 1760 

Val Lys Glu Lys Ser Gly lie Asn Ala Val Tyr Asn Phe Lys Thr Phe 

1765 1770 1775 

Gly Gin Ser Met lie Leu Leu Phe Gin Met Ser Thr Ser Ala Gly Trp 
1780 1785 1790 

Asp Gly Val Leu Asp Ala lie lie Asn Glu Glu Asp Cys Asp Pro Pro 
1795 1800 1805 

Asp Asn Asp Lys Gly Tyr Pro Gly Asn Cys Gly Ser Ala Thr Val Gly 
1810 1815 1820 

lie Thr Phe Leu Leu Ser Tyr Leu Val lie Ser Phe Leu lie Val lie 
1825 1830 1835 1840 

Asn Met Tyr lie Ala Val lie Leu Glu Asn Tyr Ser Gin Ala Thr Glu 

1845 1850 1855 

Asp Val Gin Glu Gly Leu Thr Asp Asp Asp Tyr Asp Met Tyr Tyr Glu 
1860 1865 1870 

lie Trp Gin Gin Phe Asp Pro Glu Gly Thr Gin Tyr lie Arg Tyr Asp 
% 1875 1880 1885 

Gin Leu Ser Glu Phe Leu Asp Val Leu Glu Pro Pro Leu Gin lie His 
1890 1895 1900 

Lys Pro Asn Lys Tyr Lys lie lie Ser Met Asp Met Pro lie Cys Arg 
1905 1910 1915 1920 

Gly Asp Met Met Tyr Cys Val Asp lie Leu Asp Ala Leu Thr Lys Asp 

1925 1930 1935 

Phe Phe Ala Arg Lys Gly Asn Pro lie Glu Glu Thr Gly Glu lie Gly 
1940 1945 1950 

Glu lie Ala Ala Arg Pro Asp Thr Glu Gly Tyr Asp Pro Val Ser Ser 
1955 1960 1965 

Thr Leu Trp Arg Gin Arg Glu Glu Tyr Cys Ala Lys Leu lie Gin Asn 
1970 1975 1980 

Ala Trp Arg Arg Tyr Lys Asn Gly Pro Pro Gin Glu Gly Asp Glu Gly 
1985 1990 1995 2000 



i 
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Glu Ala Ala Gly Gly Glu Asp Gly Ala Glu Gly Gly Glu Gly Glu Gly 

2005 2010 2015 

Gly Ser Gly Gly Gly Gly Asp Asp Asp Gly Gly Ser Ala Thr Ala Ala 
2020 2025 2030 

Gly Ala Thr Ser Pro Thr Asp Pro Asp Ala Gly Glu Ala Asp Gly Ala 
2035 2040 2045 

Ser Ala Gly Asn Gly Gly Gly Pro Leu Ser Pro Gly Cys Val Ser Gly 
2050 2055 2060 

Gly Ser Asn Gly Arg Gin Thr Ala Val Leu Val Glu Ser Asp Gly Phe 
2065 2070 2075 2080 

Val Thr Lys Asn Gly His Lys Val Val lie His Ser Arg Ser Pro Ser 

2085 2090 2095 

^ lie Thr Ser Arg Thr Ala Asp Val 

% 2100 

0?(2) INFORMATION FOR SEQ ID NO : 5 : 

03 

W (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 
H (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
H= (D) TOPOLOGY: linear 

Q 

nj (ii) MOLECULE TYPE: cDNA 



J (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

CGGTTGGGCT TTCCTGTC 18 
(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
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GGGAATTCRA ADATRTTCCA NCCYTC 
(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 
CCCGARGAYA THGAYCYNTA YTA 



~^2) INFORMATION FOR SEQ ID NO : 8 : 

jf (i) SEQUENCE CHARACTERISTICS: 

il (A) LENGTH: 18 base pairs 

vf (B) TYPE: nucleic acid 

fH (C) STRANDEDNESS: single 

^ (D) TOPOLOGY: linear 

^ (ii) MOLECULE TYPE: cDNA 

O (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 

ru 

Q|:gtatcgcct cctcctcg 



(2) INFORMATION FOR SEQ ID NO : 9 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 
GGGTCTAGAT HTTYGCNATH TTYGGNATG 
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(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

GGGGAATTCN GGRTCRAAYT GYTGCCA 2 7 

(2) INFORMATION FOR SEQ ID NO: 11: 

% (i) SEQUENCE CHARACTERISTICS: 
'% (A) LENGTH: 27 base pairs 

21 (B) TYPE: nucleic acid 

% (C) STRANDEDNESS: single 

[7: (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
^ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

rU GGGT CT AG AR GANCARAARA ARTAYTA 2 7 

U3(2) INFORMATION FOR SEQ ID NO: 12: 

'J3 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 : 
TCATACTTTG GCCCAATGTC 2 0 

(2) INFORMATION FOR SEQ ID NO:13: 
(i) SEQUENCE CHARACTERISTICS : 
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(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 
CCCGAATTAG AGAAGGTGCT G 
(2) INFORMATION FOR SEQ ID NO : 14 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

« (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



rn 



(ii) MOLECULE TYPE: cDNA 



N (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 
-ACTATTGCTT GTGGTCGCCA C 

l ; 
P= 

Q2) INFORMATION FOR SEQ ID NO: 15: 

fO (i) SEQUENCE CHARACTERISTICS: 

*S (A) LENGTH: 25 base pairs 

yi (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 
CATCNTTRGC NGCNTAGACN ATGAC 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GATTGAATGG ATCGAGCAGC C 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
I (D) TOPOLOGY: linear 

I (ii) MOLECULE TYPE: cDNA 



y (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 

MCGTTTCTCCT TTCATATCTA G 

L (2) INFORMATION FOR SEQ ID NO: 18: 

Si (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 base pairs 
^ (B) TYPE: nucleic acid 

1* (C) STRANDEDNESS: single 

m (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 
GGAGBGGBGG NCKBGGNCKN GCTCA 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2100 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Met Thr Glu Asp Ser Asp Ser lie Ser Glu Glu Glu Arg Ser Leu Phe 
15 10 15 

Arg Pro Phe Thr Arg Glu Ser Leu Val Gin lie Glu Gin Arg lie Ala 
20 25 30 

Ala Glu His Glu Lys Gin Lys Glu Leu Glu Arg Lys Arg Ala Glu Gly 
35 40 45 

Glu Val Pro Arg Tyr Gly Arg Lys Lys Lys Gin Lys Glu lie Arg Tyr 
50 55 60 

Asp Asp Glu Asp Glu Asp Glu Gly Pro Gin Pro Asp Pro Thr Leu Glu 
65 70 75 80 

Gin Gly Val Pro lie Pro Val Arg Leu Gin Gly Ser Phe Pro Pro Glu 

85 90 95 

Leu Ala Ser Thr Pro Leu Glu Asp lie Asp Pro Tyr Tyr Ser Asn Val 
100 105 110 

Leu Thr Phe Val Val Val Ser Lys Gly Lys Asp lie Phe Arg Phe Ser 
115 120 125 

Ala Ser Lys Ala Met Trp Met Leu Asp Pro Phe Asn Pro lie Arg Arg 
130 135 - 140 

Val Ala lie Tyr lie Leu Val His Pro Leu Phe Ser Leu Phe lie lie 
145 150 155 160 

Thr Thr lie Leu Val Asn Cys lie Leu Met lie Met Pro Thr Thr Pro 

165 170 175 

Thr Val Glu Ser Thr Glu Val lie Phe Thr Gly lie Tyr Thr Phe Glu 
180 185 190 

Ser Ala Val Lys Val Met Ala Arg Gly Phe lie Leu Cys Pro Phe Thr 
195 200 205 

Tyr Leu Arg Asp Ala Trp Asn Trp Leu Asp Phe Val Val lie Ala Leu 
210 215 220 

Ala Tyr Val Thr Met Gly lie Asp Leu Gly Asn Leu Ala Ala Leu Arg 
225 230 235 240 



Thr Phe Arg Val 



Leu Lys Thr lie 
260 



Asp Val He He 
275 

Gly Leu Gin He 
290 



Leu Arg Ala Leu 
245 

Val Gly Ala Val 



Leu Thr Met Phe 
280 

Tyr Met Gly Val 
295 



Lys Thr Val Ala 
250 

He Glu Ser Val 
265 

Ser Leu Ser Val 



Leu Thr Glu Lys 
300 



He Val Pro Gly 
255 

Lys Asn Leu Arg 
270 

Phe Ala Leu Met 
285 

Cys lie Lys Lys 



Phe Pro Leu Asp Gly Ser Trp Gly Asn Leu Thr Asp Glu Asn Trp Asp 
305 310 315 320 

Tvr His Asn Arg Asn Ser Ser Asn Trp Tyr Ser Glu Asp Glu Gly He 
Y 325 330 335 

Ser Phe Pro Leu Cys Gly Asn He Ser Gly Ala Gly Gin Cys Asp Asp 
340 345 350 

Asp Tyr Val Cys Leu Gin Gly Phe Gly Pro Asn Pro Asn Tyr Gly Tyr 
355 360 365 

Thr Ser Phe Asp Ser Phe Gly Trp Ala Phe Leu Ser Ala Phe Arg Leu 
370 375 380 

Met Thr Gin Asp Phe Trp Glu Asp Leu Tyr Gin Leu Val Leu Arg Ala 
385 390 395 400 

Ala Gly Pro Trp His Met Leu Phe Phe He Val He He Phe Leu Gly 

405 410 415 

Ser Phe Tyr Leu Val Asn Leu He Leu Ala He Val Ala Met Ser Tyr 
420 425 430 

Asp Glu Leu Gin Arg Lys Ala Glu Glu Glu Glu Ala Ala Glu Glu Glu 
435 440 445 

Ala He Arg Glu Ala Glu Glu Ala Ala Ala Ala Lys Ala Ala Lys Leu 
450 455 460 

Glu Glu Arq Ala Asn Ala Gin Ala. Gin Ala Ala Ala Asp Ala Ala Ala 
465 470 475 480 

Ala Glu Glu Ala Ala Leu His Pro Glu Met Ala Lys Ser Pro Thr Tyr 



485 490 495 

Ser Cys lie Ser Tyr Glu Leu Phe Val Gly Gly Glu Lys Gly Asn Asp 
500 505 510 

Asp Asn Asn Lys Glu Lys Met Ser lie Arg Ser Val Glu Val Glu Ser 
515 520 525 

Glu Ser Val Ser Val lie Gin Arg Gin Pro Ala Pro Thr Thr Ala His 
530 535 540 

Gin Ala Thr Lys Val Arg Lys Val Ser Thr Thr Ser Leu Ser Leu Pro 
545 550 555 560 

Gly Ser Pro Phe Asn lie Arg Arg Gly Ser Arg Ser Ser His Lys Tyr 

565 570 575 

Thr He Arg Asn Gly Arg Gly Arg Phe Gly He Pro Gly Ser Asp Arg 
580 585 590 

Lys Pro Leu Val Leu Ser Thr Tyr Gin Asp Ala Gin Gin His Leu Pro 
595 600 605 

Tyr Ala Asp Asp Ser Asn Ala Val Thr Pro Met Ser Glu Glu Asn Gly 
610 615 620 

Ala He He Val Pro Val Tyr Tyr Gly Asn Leu Gly Ser Arg His Ser 
625 630 635 640 

Ser Tyr Thr Ser His Gin Ser Arg He Ser Tyr Thr Ser His Gly Asp 

645 650 655 

Leu Leu Gly Gly Met Ala Val Met Gly Val Ser Thr Met Thr Lys Glu 
660 665 670 



Ser Lys Leu Arg 
675 

Asn Gly Gly Thr 
690 

Asp Tyr Glu He 
705 

His His Asp Asn 



Asn Arg Asn Thr 
680 

Thr Cys Leu Asp 
695 

Gly Leu Glu Cys 
710 

Pro Phe He Glu 
725 



Arg Asn Gin Ser 



Thr Asn His Lys 
700 

Thr Asp Glu Ala 
715. 

Pro Val Gin Thr 
730 



Val Gly Ala Thr 
685 

Leu Asp His Arg 



Gly Lys He Lys 
720 

Gin Thr Val Val 
735 
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Asp Met Lys Asp 
740 



Gly Arg His Ser 
755 

Gly Pro Thr Phe 
770 

Asp Val Phe Cys 
785 

Glu Trp Val Ser 



Thr Leu Cys lie 
820 



Asp Met Asn Lys 
835 

Phe Thr Ala Thr 
850 

Ser Pro Lys Tyr 
865 

lie Val Ala Leu 



Leu Ser Val Leu 
900 



Lys Ser Trp Pro 
915 

Met Gly Ala Leu 
930 

lie Phe Ala Val 
945 



His Lys Asp Arg 



Asp Phe Met His 
980 



Val Met Val Leu 



Arg Ala Ser Asp 
760 



Lys Asp Lys Ala 
775 

Val Trp Asp Cys 
790 

Leu lie Val Phe 
805 

Val Val Asn Thr 



Glu Met Glu Arg 
840 

Phe Ala lie Glu 
855 

Tyr Phe Gin Glu 
870 

Ser Leu Leu Glu 
885 

Arg Ser Phe Arg 



Thr Leu Asn Leu 
920 

Gly Asn Leu Thr 
935 

Met Gly Met Gin 
950 



Phe Pro Asp Gly 
965 

Ser Phe Met lie 



Asn Asp lie lie 
745 

Arg Gly Glu Asp 



Leu Glu Val lie 
780 

Cys Trp Val Trp 
795 

Asp Pro Phe Val 
810 

Met Phe Met Ala 
825 

Val Leu Lys Ser 



Ala Thr Met Lys 
860 

Gly Trp Asn lie 
875 

Leu Gly Leu Glu 
890 

Leu Leu Arg Val 
905 

Leu lie Ser lie 



Phe Val Leu Cys 
940 

Leu Phe Gly Lys 
955 



Asp Leu Pro Arg 
970 

Val Phe Arg Val 
985 



Glu Gin Ala Ala 
750 

Asp Asp Glu Asp 
765 

Leu Lys Gly lie 



Leu Lys Phe Gin 
800 

Glu Leu Phe lie 
815 

Met Asp His His 
830 

Gly Asn Tyr Phe 
845 

Leu Met Ala Met 



Phe Asp Phe lie 
880 

Gly Val Gin Gly 
895 

Phe Lys Leu Ala 
910 

Met Gly Arg Thr 
925 

lie lie lie Phe 



Asn Tyr His Asp 
960 



Trp Asn Phe Thr 
975 

Leu Cys Gly Glu 

- 990 



f 
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Trp He Glu Ser Met Trp Asp Cys Met Tyr Val Gly Asp Val Ser Cys 
995 1000 1005 

He Pro Phe Phe Leu Ala Thr Val Val He Gly Asn Leu Val Val Leu 
10 io 1015 1020 

Asn Leu Phe Leu Ala Leu Leu Leu Ser Asn Phe Gly Ser Ser Ser Leu 
1025 1030 1035 1040 

Ser Ala Pro Thr Ala Asp Asn Asp Thr Asn Lys He Ala Glu Ala Phe 

10 45 1050 1055 

Asn Arq He Gly Arg Phe Lys Ser Trp Val Lys Arg Asn He Ala Asp 
1060 1065 1070 

Cvs Phe Lys Leu He Arg Asn Lys Leu Thr Asn Gin He Ser Asp Gin 
1 10 75 1° 80 1085 

Pro Ser Glu His Gly Asp Asn Glu Leu Glu Leu Gly His Asp Glu He 
10 90 1095 HOO 

Leu Ala Asp Gly Leu He Lys Lys Gly He Lys Glu Gin Thr Gin Leu 
1105 IHO HI 5 1120 

Glu Val Ala He Gly Asp Gly Met Glu Phe Thr He His Gly Asp Met 

1125 1130 1135 

Lvs Asn Asn Lys Pro Lys Lys Ser Lys Tyr Leu Asn Asn Ala Thr Asp 
1140 1145 1150 

Asp Asp Thr Ala Ser He Asn Ser Tyr Gly Ser His Lys Asn Arg Pro 
H55 1160 1165 

Phe Lys Asp Glu Ser His Lys Gly Ser Ala Glu Thr Met Glu Gly Glu 
livO 1175 1180 

Glu Lys Arg Asp Ala Ser Lys Glu Asp Leu Gly Leu Asp Glu Glu Leu 
1185 1190 1195 1200 

Asn Glu Glu Gly Glu Cys Glu Glu Gly Pro Leu Asp Gly Asp He He 

1205 1210 1215 

He His Ala His Asp Glu Asp He Leu Asp Glu Tyr Pro Ala Asp Cys 
1220 1225 1230 

Cys Pro Asp Ser Tyr Tyr Lys Lys Phe Pro He Leu Ala Gly Asp Asp 
1235 1240 1245 
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Asp Ser Pro Phe Trp Gin Gly Trp Gly Asn Leu Arg Leu Lys Thr Phe 
1250 1255 1260 

Arg Leu lie Glu Asp Lys Tyr Phe Glu Thr Ala Val lie Thr Met lie 
1265 1270 1275 1280 

Leu Met Ser Ser Leu Ala Leu Ala Leu Glu Asp Val His Leu Pro Gin 

1285 1290 1295 

Arg Pro lie Leu Gin Asp lie Leu Tyr Tyr Met Asp Arg lie Phe Thr 
1300 1305 1310 

Val lie Phe Phe Leu Glu Met Leu lie Lys Trp Leu Ala Leu Gly Phe 
1315 1320 1325 

Lys Val Tyr Leu Thr Asn Ala Trp Cys Trp Leu Asp Phe Val lie Val 
1330 1335 1340 

Met Val Ser Leu lie Asn Phe Val Ala Ser Leu Val Gly Ala Gly Gly 
1345 1350 1355 1360 

lie Gin Ala Phe Lys Thr Met Arg Thr Leu Arg Ala Leu Arg Pro Leu 

1365 1370 1375 

Arg Ala Met Ser Arg Met Gin Gly Met Arg Val Val Val Asn Ala Leu 
1380 1385 1390 

Val Gin Ala lie Pro Ser lie Phe Asn Val Leu Leu Val Cys Leu lie 
1395 1400 1405 

Phe Trp Leu lie Phe Ala lie Met Gly Val Gin Leu Phe Ala Gly Lys 
1410 1415 1420 

Tyr Phe Lys Cys Glu Asp Met Asn Gly Thr Lys Leu Ser His Glu lie 
1425 1430 1435 1440 

lie Pro Asn Arg Asn Ala Cys Glu Ser Glu Asn Tyr Thr Trp Val Asn 

1445 1450 1455 

Ser Ala Met Asn Phe Asp His Val Gly Asn Ala Tyr Leu Cys Leu Phe 
1460 1465 1470 

Gin Val Ala Thr Phe Lys Gly Trp lie Gin lie Met Asn Asp Ala lie 
1475 1480 1485 

Asp Ser Arg Glu Val Asp Lys Gin Pro He Arg Glu Thr Asn He Tyr 
1490 1495 1500 



Met Tyr Leu Tyr 
1505 

Leu Asn Leu Phe 



Phe Val Phe Phe 
1510 

lie Gly Val He 
1525 



He He Phe Gly 
1515 

He Asp Asn Phe 
1530 



Ser Phe Phe Thr 
1520 

Asn Glu Gin Lys 
1535 



Lys Lys Ala Gly Gly Ser Leu Glu Met Phe Met Thr Glu Asp Gin Lys 
1540 1545 1550 

Lys Tyr Tyr Ser Ala Met Lys Lys Met Gly Ser Lys Lys Pro Leu Lys 
1555 1560 1565 

Ala He Pro Arg Pro Arg Trp Arg Pro Gin Ala He Val Phe Glu He 
1570 1575 1580 

Val Thr Asp Lys Lys Phe Asp He He He Met Leu Phe He Gly Leu 
1585 1590 1595 1600 

Asn Met Phe Thr Met Thr Leu Asp Arg Tyr Asp Ala Ser Asp Thr Tyr 

1605 1610 1615 

Asn Ala Val Leu Asp Tyr Leu Asn Ala He Phe Val Val He Phe Ser 
1620 1625 1630 

Ser Glu Cys Leu Leu Lys He Phe Ala Leu Arg Tyr His Tyr Phe He 
1635 1640 1645 

Glu Pro Trp Asn Leu Phe Asp Val Val Val Val He Leu Ser He Leu 
1650 1655 1660 

Gly Leu Val Leu Ser Asp He He Glu Lys Tyr Phe Val Ser Pro Thr 
1665 1670 1675 1680 

Leu Leu Arg Val Val Arg Val Ala Lys Val Gly Arg Val Leu Arg Leu 

1685 1690 1695 

Val Lys Gly Ala Lys Gly He Arg Thr Leu Leu Phe Ala Leu Ala Met 
1700 1705 1710 

Ser Leu Pro Ala Leu Phe Asn He Cys Leu Leu Leu Phe Leu Val Met 
1715 1720 1725 

Phe lie Phe Ala He Phe Gly Met Ser Phe Phe Met His Val Lys Glu 
1730 1735 . 1740 

Lys Ser Gly He Asn Asp Val Tyr Asn Phe Lys Thr Phe Gly Gin Ser 
1745 1750 1755 1760 



a 
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Met He Leu Leu Phe Gin Met Ser Thr Ser Ala Gly Trp Asp Gly Val 

1765 1770 1775 

Leu Asd Ala He He Asn Glu Glu Ala Cys Asp Pro Pro Asp Asn Asp 
1780 1785 1790 

Lvs Glv Tvr Pro Gly Asn Cys Gly Ser Ala Thr Val Gly He Thr Phe 
1 1795 1800 1805 

Leu Leu Ser Tyr Leu Val He Ser Phe Leu He Val He Asn Met Tyr . 
1810 1815 1820 

He Ala Val He Leu Glu Asn Tyr Ser Gin Ala Thr Glu Asp Val Gin 
1825 1830 1835 1840 

Glu Gly Leu Thr Asp Asp Asp Tyr Asp Met Tyr Tyr Glu He Trp Gin 

1845 1850 1855 

Gin Phe Asp Pro Glu Gly Thr Gin Tyr He Arg Tyr Asp Gin Leu Ser 
I860 1865 1870 

Glu Phe Leu Asp Val Leu Glu Pro Pro Leu Gin He His Lys Pro Asn 
1875 1880 1885 

Lvs Tvr Lys He He Ser Met Asp He Pro He Cys Arg Gly Asp Leu 
1890 1895 1900 

Met Tyr Cys Val Asp He Leu Asp Ala Leu Thr Lys Asp Phe Phe Ala 
1905 1910 1915 1920 

Ara Lvs Gly Asn Pro lie Glu Glu Thr Gly Glu He Gly Glu He Ala 
a 1925 1930 1935 

Ala Ara Pro Asp Thr Glu Gly Tyr Glu Pro Val Ser Ser Thr Leu Trp 
1940 1945 1950 

Ara Gin Arg Glu Glu Tyr Cys Ala Arg Leu He Gin His Ala Trp Arg 
S 1955 1960 1965 

Lvs His Lys Ala Arg Gly Glu Gly Gly Gly Ser Phe Glu Pro Asp Thr 
1970 1975 1980 

As n His Gly Asp Gly Gly Asp Pro Asp Ala Gly Asp Pro Ala Pro Asp 
1985 1990 . 1995 2000 

Glu Ala Thr Asp Gly Asp Ala Pro Ala Gly Gly Asp Gly Ser Val Asn 

2005 2010 2015 
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Gly Thr Ala Glu Gly Ala Ala Asp Ala Asp Glu Ser Asn Val Asn Ser 
2020 2025 2030 

Pro Gly Glu Asp Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala 
2035 2040 2045 

Ala Ala Gly Thr Thr Thr Ala Gly Ser Pro Gly Ala Gly Ser Ala Gly 
2050 2055 2060 

Arg Gin Thr Ala Val Leu Val Glu Ser Asp Gly Phe Val Thr Lys Asn 
2065 2070 2075 2080 

Gly His Lys Val Val lie His Ser Arg Ser Pro Ser lie Thr Ser Aorg 

2085 2090 2095 

Thr Ala Asp Val 
2100 



